Picture for Haiyang Xu

Haiyang Xu

STAMP: Training Explicit Memory for Mobile GUI Agents in Controllable and Scalable Virtual Environments

Add code
May 28, 2026
Viaarxiv icon

ToolCUA: Towards Optimal GUI-Tool Path Orchestration for Computer Use Agents

Add code
May 12, 2026
Viaarxiv icon

SemLayer: Semantic-aware Generative Segmentation and Layer Construction for Abstract Icons

Add code
Mar 25, 2026
Viaarxiv icon

CyCLeGen: Cycle-Consistent Layout Prediction and Image Generation in Vision Foundation Models

Add code
Mar 16, 2026
Viaarxiv icon

Mobile-Agent-v3.5: Multi-platform Fundamental GUI Agents

Add code
Feb 15, 2026
Viaarxiv icon

AgentOCR: Reimagining Agent History via Optical Self-Compression

Add code
Jan 08, 2026
Viaarxiv icon

CVP: Central-Peripheral Vision-Inspired Multimodal Model for Spatial Reasoning

Add code
Dec 09, 2025
Viaarxiv icon

MonkeyOCR v1.5 Technical Report: Unlocking Robust Document Parsing for Complex Patterns

Add code
Nov 16, 2025
Viaarxiv icon

Efficient and Effective In-context Demonstration Selection with Coreset

Add code
Nov 12, 2025
Viaarxiv icon

Learning Filter-Aware Distance Metrics for Nearest Neighbor Search with Multiple Filters

Add code
Nov 06, 2025
Figure 1 for Learning Filter-Aware Distance Metrics for Nearest Neighbor Search with Multiple Filters
Figure 2 for Learning Filter-Aware Distance Metrics for Nearest Neighbor Search with Multiple Filters
Figure 3 for Learning Filter-Aware Distance Metrics for Nearest Neighbor Search with Multiple Filters
Figure 4 for Learning Filter-Aware Distance Metrics for Nearest Neighbor Search with Multiple Filters
Viaarxiv icon